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Chapter  One-  Introduction  and  Statement  of  the  Problem 


Section  1.1-  Introduction 


"The  problems  of  radar  clutter  have  been  discussed  in  all  the  basic 
radar  texts,  and  remain  the  subject  of  extensive  current  research. 
Interest  in  clutter  is  almost  universal  in  radar  because  most  of  the 
echo  signals  received  at  the  radar  originate  not  from  the  desired  target 
but  from  surrounding  objects  or  surfaces  which  tend  to  mask  the  target 
signal.  The  technologies  of  MTI,  doppler  resolution,  and 
high-resolution  pulse  compression  have  been  driven  by  the  need  to 
separate  targets  from  the  clutter  background,  and  the  basic  decisions  in 
radar  system  engineering  are  dependent  upon  the  details  of  the  clutter 
environment  in  which  the  radar  is  expected  to  operate." 

-  David  K.  Barton  [1] 


Clutter  has  been  defined  as  a  conglomeration  of  unwanted  radar 
echoes  [2] .  The  undesirability  of  such  echoes  is  related  to  the 
application  of  the  radar.  To  airborne  radar  in  search  of  aircraft, 
clouds,  terrain,  and  man-made  structures  are  clutter  sources.  However, 
to  airborne  ground-mapping  radar  and  meteorological  radar,  targets  of 
interest  include  terrain  and  clouds.  Aircraft  echoes  are  extraneous 
returns  to  these  types  of  radar  and  are  thus  clutter.  The  terms  clutter 
and  backscatter  will  be  used  interchangably  in  this  report. 

The  dependence  of  radar  system  design  upon  the  statistics  of  local 
clutter  is  the  fundamental  motivation  of  the  present  study.  Whereas 
forecasting  techniques  which  are  based  on  theoretical  terrain  models 
exist,  it  is  desirable  that  more  accurate  clutter  prediction  methods  be 
developed  and  applied  to  such  design  problems.  In  particular,  a 
prognostication  of  the  clutter  environment  of  a  land-based,  search  radar 


formed  on  information  supplied  by  topographic  maps  would  be  advantageous 
in  both  the  selection  of  future  radar  sites  and  the  eventual  design  of 


the  radar  system. 

In  the  search  for  a  relationship  between  clutter  and  topographic 
features,  the  ground  clutter  received  by  land-based,  search  radar  is  of 
prime  concern.  The  main  constituents  of  ground  clutter  are  terrain 
echoes,  returns  from  man-made  structures,  and  reflections  from  bodies  of 
water.  This  type  of  clutter  may  be  divided  into  two  categories:  1) 
returns  from  point  scatterers,  and  2)  returns  from  distributed 
scatterers.  A  point  scatterer  is  an  object  whose  entire  volume  may  be 
wholly  contained  in  a  single  resolution  cell  of  the  radar.  In  contrast 
to  the  point  scatterer,  the  distributed  scatterer  comprises  many 
individual  scatterers  within  a  resolution  cell  so  that  this  type  of 
scatterer  is  useful  in  describing  objects  which  are  large  in  comparison 
to  the  radar  resolution.  Common  examples  of  point  scatterers  and 
distributed  scatterers  are  water  towers  and  mountainsides, 
respectively. 

Radar  design  decisions  are  sensitive  to  the  projected  clutter 
environment.  A  basic  tenet  of  radar  system  design  is  the  necessity  to 
reduce  clutter  echoes  while  maintaining  or  enhancing  target  echoes.  The 
objective  of  this  principle  is  the  optimization  of  radar  performance  in 
regard  to  target  discernment  by  either  human  operators  or  automatic 
radars.  Clutter  is  a  much  greater  problem  in  automatic  radar  than  in 
manned  radar.  The  automatic  radar  must  be  designed  to  self-adjust  to 
the  clutter  statistics  of  each  resolution  cell  so  as  to  maintain  control 
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over  the  system's  false  alarm  rate.  For  this  reason,  the  clutter 
environment  must  be  known  if  radar  performance  is  to  be  satisfactory. 
Clutter  statistics  can  also  greatly  influence  the  selection  of  the 
fundamental  radar  features  and  parameters  that  may  be  employed  for 
clutter  reduction.  These  features  include  moving  target  indication 
(MTI),  small  resolution  cell  size,  multiple  beams  in  elevation,  swept 
gain,  and  so  on. 

An  accurate  clutter  forecast  of  a  future  radar  site  from  the 
surrounding  topographic  features  would  supply  information  to  the  radar 
designer  which  may  prove  to  be  indispensible.  The  determination  of 
clutter  forecasting  techniques  involves  the  invention  of  accurate 
clutter  models.  Two  approaches  to  determining  clutter  models  may  be 
taken.  The  first  involves  the  description  of  terrain  to  be  used  as  a 
basis  in  formulating  a  scattering  model  which  may  include  such 
parameters  as  ground  reflectivity  and  doppler  spread.  The  scattering 
model  would  then  be  used  to  predict  the  clutter  environment  of  existing 
radar.  This  prediction  would  be  compared  for  accuracy  to  the  real 
clutter  environment.  The  second  approach  to  this  problem  involves  a 
different  tack.  Instead  of  beginning  with  topographic  features  and  then 
proposing  a  clutter  model,  this  second  method  analyzes  clutter  returns 
from  areas  of  unknown  topography.  Features  of  the  clutter  returns  are 
determined  and  areas  described  so  that  relationships  between  the  clutter 
and  the  topography  of  the  radar  environment  may  be  sought.  The 
advantage  of  this  approach  is  that  preconceived  ideas  as  to  the 
association  of  topography  and  clutter  can  not  influence  the  final 
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determination.  This  is  the  approach  taken  in  this  thesis. 

The  traditional  approach  to  clutter  analysis  has  been  to  measure 
the  backscatter  return  from  different  types  of  terrain  [3,4,5).  Quite 
often,  an  effective  radar  cross-section  or  a  reflectivity  coefficient 
was  assigned  to  characterize  the  terrain  types.  However,  these 
measurements  have  yielded  results  which  are  sensitive  to  the  physical 
environment  and  the  system  parameters  of  the  radar.  For  example,  the 
difference  in  backscatter  measurements  for  a  section  of  terrain  measured 
when  dry  and  then  measured  again  shortly  after  being  irrigated  has 
proven  to  be  as  great  as  5-7dB  [6).  Also,  wind  velocity,  when  changed 
sufficiently,  can  cause  large  swings  in  backscatter  measurements  taken 
from  vegetated  areas  such  as  forests  or  wheat  fields  [7]. 

The  inconsistency  of  terrain  backscatter  measurements  has  prodded 
researchers  to  attempt  to  gain  a  greater  understanding  of  clutter 
through  the  analysis  of  actual  returns.  Haykin  has  published  papers  on 
the  spectral  classification  of  clutter.  The  spectral  analysis  employed 
the  maximum  entropy  method  to  compute  doppler  spectra  of  radar  clutter 
[8].  Also,  Haykin  has  attempted  to  classify  clutter  according  to  a 
one-dimensional  spectral  analysis  by  transforming  the  clutter  into  a 
one-dimensional  sequence  [9],  In  addition  to  these  studies,  many 
researchers  have  attempted  to  characterize  the  statistical  nature  of 
clutter  [10,11].  The  traditionally  accepted  clutter  distribution  was 
the  Rayleigh  distribution.  However,  recent  publications  have  opposed 
this  view  [12].  The  realization  that  clutter  may  not  be  strictly 
Rayleigh  distributed  has  come  from  the  analysis  of  clutter  data. 
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Section  1 .2-  Statement  of  the  Problem 

The  specific  problem  which  is  considered  in  this  thesis  is  the 
definition  and  location  of  homogeneous  regions  in  radar  clutter.  The 
definitions  of  homogeneity  that  are  presented  in  this  thesis  are  tested 
on  two  data  sets. 

Section  1.3-  Discussion  of  the  Problem 

It  is  necessary  to  define  regions  in  radar  clutter  that  are,  in 
some  sense,  homogeneous  so  that  the  radar  environment,  which  is 
generally  quite  extensive  and  complex,  may  be  divided  into  its  component 
parts.  The  existence  of  such  homogeneous  regions  seems  intrinsic  to  the 
topography-clutter  relationship  problem.  However,  the  relationship  may 
involve  correspondence  other  than  that  suggested  by  considering 
topography  alone,  since  this  study  considers  the  clutter  itself  as  the 
basis  of  homogeneity.  A  definition  of  homogeneity  based  on  clutter 
returns  from  the  radar  environment  may  not  involve  the  types  of 
homogeneity  that  topography  may  infer,  e.g.,  forests,  cornfields,  and 
mountains.  It  is  conceivable  that  a  homogeneous  region  defined  on  a 
clutter  return  map  may  not  be  homogeneous  with  respect  to  its 
topography. 

The  work,  which  has  this  thesis  as  a  result,  involved  the  analysis 
of  three  data  sets  supplied  by  Rome  Air  Development  Center  (RADC).  The 
first  is  a  clutter  intensity  plot  of  the  Utica,  New  York  area.  The 


Utica  clutter  map  is  81  degrees  in  azimuth  and  ranges  from  9  kilometers 
to  27  kilometers  from  the  radar  site.  The  second  data  set  consists  of 


ten  statistically  independent  clutter  plots  of  an  area  southwest  of 
Rome,  New  York.  These  clutter  maps  were  recorded  at  ten  different 
frequencies.  The  region  is  located  between  azimuths  208  degrees  and  232 
degrees,  and  ranges  from  14-20  nautical  miles  in  relation  to  the  radar 
at  RADC.  This  area  is  dominated  by  two  hills,  West  Stockbridge  Hill  and 
Eaton  Hill.  The  third  data  set  was  taken  from  roughly  the  same  area. 
It  encompasses  clutter  returns  within  azimuths  205  degrees  and  233 
degrees,  and  ranges  14-20  nautical  miles.  The  third  data  set  was 
coherently  detected  so  that  doppler  processing  could  be  performed  on  the 
clutter  measurements.  Two  definitions  of  homogeneity  have  arisen  from 
the  analysis  of  these  three  data  sets. 

The  first  definition  of  homogeneity  is  based  on  a  two-dimensional 
power  spectral  analysis  of  the  clutter  data.  Viewing  the  clutter 
returns  as  a  two-dimensional  signal  in  range  and  azimuth,  it  was 
reasoned  that  any  area  which  yields  a  white,  two-dimensional  spectrum 
should  be  homogeneous.  Such  a  definition  would  locate  areas  which 
either  possess  randomly  distributed  clutter  intensities,  or  a  single 
point  scatterer  as  the  main  source  of  radar  leturn.  These  point 
scatterers  may  be  located  by  methods  being  developed  by  William  Ladew  in 
a  concurrent  study  and  thus  are  removable  so  that  the  first  definition 
of  homogeneity  corresponds  only  to  the  random  intensity  areas. 

The  second  definition  of  homogeneity  requires  the  ten  different 
frequency  sweeps  provided  by  the  second  data  set.  It  is  a  definition 
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ident ical 


which  is  based  on  regions  which  possess  point-by-point 
populations,  i.  e.,  each  point  in  the  region  represents  a  random 
variable  with  the  same  distribution  as  every  other  point.  This 
definition  relies  on  the  Kruskal-Wallis  test  for  identical  populations 
[13]. 

Two-dimensional  spectra  were  computed  for  the  Utica  area  and  for  a 
mean  value  version  of  the  West  Stockbridge  Hill  area.  This  processing 
has  led  to  the  discovery  of  several  areas  in  the  Utica  clutter  map  which 
are  homogeneous  under  the  first  definition.  The  test  for  identical 
distributions  was  applied  to  the  ten  statistically  independent  frequency 
sweeps  of  the  West  Stockbridge  Hill  area.  Many  small  homogeneous 
regions  and  several  larger  homogeneous  regions  under  the  second 
definition  were  found  to  exist.  Doppler  spectra  and  the  corresponding 
doppler  spreads  were  computed  and  plotted  so  that  a  doppler  spread  map 
could  be  compared  with  the  homogeneous  regions  which  were  located. 

The  following  is  an  outline  of  the  remainder  of  this  paper. 
Chapter  Two  describes  the  three  data  sets  in  detail.  Chapters  Three  and 
Four  motivate  and  define  the  two  types  of  homogeneity.  Chapter  Five  is 
divided  into  two  sections,  each  discussing  the  results  for  one  type  of 
homogeneity.  Chapter  Six  contains  the  relevant  conclusions  and  presents 
suggestions  for  future  work. 


Chapter  Two-  Description  of  the  Backscatter  Data  Sets 


Section  2.1-  The  Radar 


The  ground  clutter  analyzed  in  this  thesis  consists  of  three  data 
sets.  All  three  sets  were  recorded  with  the  experimental  radar  system 
located  in  Building  106  at  RADC.  This  radar  is  an  experimental  coherent 


L-band  system  with  real  time  operator  control  of  a  wide  range  of  system 
parameters.  While  the  basic  system  is  capable  of  operating  over  1175  to 
1375  MHz,  the  standard  operating  configuration  allows  frequency  agile 
operation  over  1250  to  1280  MHz.  Quadrature  backscatter  data  is 
recorded  through  a  flexible  high-speed  digital  buffer  memory.  The 


system  parameters  that  are  operator  selected  and  computer  controlled  are 
the  following: 


PRF 

Bandwidth 
Coded  waveforms 

Matched  filtering 

Pulse  width 
Frequency 
Burst  mode 
High  speed 
recording  buffer 


166  Hz  to  6000  Hz 

60  MHz  max.  instantaneous  (limited  by  A/D) 
LFM  or  up  to  1024  chip  phase  code  (8  bit 
phase  quantization) 

off-line  array  processor  provides  general 
purpose  matched  filtering  in  near  real  time 
up  to  360  microsecond  (or  6%  duty  cycle) 
programmable  or  operator  controlled 
up  to  15  pulses  per  burst 

10240  complex  words  (10  bit  I  and  Q) 

A/D  converters:  60  MHz  @  6  bits 
20  MHz  @  8  bits 

programmable  recording  window: 

*  start  range 

*  stop  range 

*  start  azimuth 

*  number  of  pulses  per  recording 
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The  TWT  transmitter  can  provide  a  peak  power  of  160  kW.  The 
antenna  used  in  the  L-band  system  is  an  FPS-8  antenna  with  a  2.7  degree 
one-way  azimuthal  beamwidth. 

Section  2.2-  The  Utica  Data  Set 


The  first  set  of  clutter  data  consists  of  envelope-detected  values 
which  are  quantized  to  eight  bits.  There  are  81  recorded  azimuths  with 
240  range  cells  per  azimuth.  The  sector  of  ground  is  located  between 
100  and  180  degrees  (0  degrees  is  true  north)  and  ranges  from  9  km  to  27 
km.  This  area  includes  the  city  of  Utica  and  its  surroundings.  The 
significant  radar  and  recording  parameters  are  listed  below: 


radar  frequency:  1265  MHz 
radar  peak  power:  160  kW 
radar  pulse  width:  0.002  msec 
radar  bandwidth:  1  MHz 
A/D  sample  rate:  2  MHz 
record  range:  9-27  km 
record  azimuth:  100-180  degrees 
3  dB  beamwidth:  3  degrees 
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The  parameters  of  pulse  width,  beamwidth,  and  A/D  sample  rate  may 
be  used  to  determine  the  resolution  cell  size  and  number  of  samples  per 
cell.  The  pulse  width  i  may  be  inserted  in  the  following  equation  to 
obtain  the  range  resolution  R^ : 


R 

r 


c  i  _  (3x10^  m/s)(2x!0  ^s) 
2  2 


300  m. 


(2.1) 


where  c  is  the  speed  of  light  in  free  space.  The  A/D  sample  rate  of  2 
MHz  yields  a  value  of  range  difference  between  samples  as  in  the 
following  equation: 

g 

„  *  3x10  m/s  1  ,c  c-, 

R  =  - t -  x  j  =  75  m.  (2.2) 

r  2x10  cycles/s 


This  corresponds  to  4  samples  per  range  cell  since 


N 


R 


300  m 
75  m 


(2.3) 


In  azimuth,  the  3  dB  beamwidth  is  the  measure  of  resolution.  The  data 
were  recorded  in  steps  of  1  degree  in  azimuth.  Therefore,  the  azimuthal 
resolution  cell  size  is  3  samples.  The  azimuthal  resolution  cell  size 
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in  meters  varies  from 


9  km 


2ird 

360°  X 


3° 


4  7  lm. 


(2.4) 


to 


R,  -  1414  m. 

A 

27  km 


(2.5) 


These  resolution  widths  are  the  arcs  subtended  by  an  angle  of  3 
degrees.  At  18  km,  the  azimuthal  resolution  is  942  meters.  We  may  use 
this  value  as  a  representative  azimuthal  resolution  size.  Therefore, 
the  average  resolution  cell  size  for  the  first  set  of  envelope-detected, 
magnitude  data  is  300  m  in  range  and  942  m  in  azimuth.  The  resolution 
cell  is  oversampled  in  both  range  and  azimuth,  and  consists  of  4  samples 
in  range  and  3  samples  in  azimuth. 

The  Utica  data  set  is  displayed  in  Figure  2.1.  This  radial  graph 
is  a  simulated  PPI  display.  The  darker  areas  of  the  plot  represent  the 
areas  of  higher  clutter  return. 
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Section  2.3-  The  West  Stockbridge  Hill  Data  Set 

The  second  set  of  data  was  recorded  using  a  40  microsecond  LFM 

pulse  waveform.  The  pulse  bandwidth  was  2.5  MHz.  The  motivation  for 

using  the  LFM  waveform  is  that  a  long  pulse  with  a  high  level  of  signal 

energy  may  be  used  to  increase  detection  capability  at  no  expense  to 

range  resolution.  This  is  so  because  the  LFM  pulse  may  be  compressed  by 

using  signal  processing  techniques  to  increase  resolution.  In  this 

case,  the  compressed  pulse  width  given  by  the  following: 

1 

\  =  tv"  us  =  0.564  us,  (2.6) 

c  B  2.5 

where  k  represents  mainlobe  broadening  and  B  is  the  bandwidth  of  the  LFM 
pulse,  corresponds  to  a  range  resolution  given  by: 

c  1 

R  =  ~  =  84.6  m.  (2.7) 

r  2 


As  for  the  first  data  set,  the  3  dB  beamwidth,  and  hence  the  azimuthal 
resolution,  is  3  degrees.  Once  again  the  beam  is  stepped  in  1  degree 
increments  yielding  3  samples  per  azimuthal  resolution  cell. 

The  data  were  recorded  between  azimuths  209  and  232  degrees,  and 
ranges  25.9  and  33.9  km.  The  A/D  sample  rate  was  5  MHz  which  gives  2 
samples  per  range  resolution  cell.  The  overall  resolution  cell  is  3 
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samples  in  azimuth  and  2  samples  in  range.  The  azimuthal  resolution 
width  varies  from  1356  m  at  25.9  km  to  1775  m  at  33.9  km.  The  average 
azimuthal  resolution  is  1566  m,  so  that  the  average  resolution  cell  size 
for  the  second  data  set  may  be  taken  to  be  60  m  in  range  and  1566  m  in 
azimuth. 

The  data  values  were  coherently  detected  and  recorded  in  I 
(in-phase)  and  Q  (quadrature)  format.  Quantization  was  to  23  bits  of 
mantissa  and  7  bits  of  exponent  with  a  sign  bit  for  each.  The  bit 
representation  is  in  two's  complement.  The  decimal  value  of  the 
magnitude  of  I  and  Q  was  the  value  used  in  the  data  analysis. 

The  unique  feature  of  this  second  data  set  is  that  it  provides  ten 
statistically  independent  radar  scans  of  an  area  of  ground  southwest  of 
RADC  that  includes  West  Stockbridge  Hill  and  part  of  Eaton  Hill.  The 
statistical  independence  was  obtained  by  frequency  hopping  the  radar 
from  1253  KHz  to  1280  MHz  in  steps  of  3  MHz.  The  separation  in 
frequency  is  sufficient  to  guarantee  statistical  independence  but  is  not 
enough  to  change  the  reflectivity  coefficients  of  the  ground. 
Therefore,  the  ten  numerical  values  associated  with  each  range-azimuth 
sample  may  be  regarded  as  ten  statistically  independent  samples  of  a 
random  variable.  To  ensure  spatial  independence  for  subsequent 
statistical  testing,  one  range-azimuth  sample  from  each  resolution  cell 
was  selected.  This  was  done  by  resampling  at  one-third  the  original 
rate  in  azimuth  and  one-fourth  the  rate  in  range.  Thus,  the  data  per 
cell  are  reduced  to  ten  statistically  independent  samples  of  a  random 
variable,  which  is  the  value  of  backscatter  magnitude  return  from  each 


cell. 


This  is  the  data  set  used  to  identify  homogeneous  regions  under 


the  second  definition  of  homogeneity. 

A  version  of  the  West  Stockbridge  Hill  data  set  is  displayed  in 
Figure  2.2.  This  data  set  is  a  mean  value  per  resolution  cell 
representation  of  the  original  data  set. 

Section  2.4-  The  Coherently  Detected  Stockbridge  Data  Set 

The  third  set  of  data  was  recorded  using  a  6.4  microsecond,  5  MHz 
bandwidth,  LFM  pulse.  The  pulse  compression  was  done  off-line  as  for 
the  second  data  set.  The  weighting  used  was  Taylor  weighting  [14]  with 
6  equi-amplitude  sidelobes  40  dB  down  from  the  main  lobe.  A  total  of 
ten  azimuths  were  recorded  from  205  to  232  degrees  in  3  degree 
increments.  This  means  that  each  recorded  azimuth  is  independent  of 
every  other  azimuth.  Each  azimuth  was  divided  in  range  into  six  range 
windows,  each  consisting  of  50  range  samples.  The  50  samples  were 
computed  from  the  50  samples  in  the  window  plus  30  samples  preceding  the 
window.  This  computation  was  performed  by  the  aforementioned  pulse 
compression  routine. 

Now,  each  range  window  in  each  azimuth  is  illuminated  with  128 
pulses  at  a  pulse  interval  of  6  milliseconds.  These  pulses  are 
generated  and  detected  coherently  so  that  their  echoes  may  be  used  to 
estimate  the  relative  vector-sum  velocity  of  the  scatterers  in  an  area 
of  ground  corresponding  to  the  resolution  cell,  which  is  the  same  size 
as  for  the  second  data  set.  The  samples  per  cell,  however,  are  one  in 
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azimuth  and  two  in  range.  Each  pulse  is  sampled  at  every  point  in  the 
range  window  so  that  the  data  for  one  range  window  after  compression  is 
a  two-dimensional  array,  50  samples  by  128  samples.  The  magnitude 
quantization  for  the  third  data  set  is  the  same  as  for  the  second  data 
set . 
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Chapter  Three-  The  Rough  Clutter  Definition 


This  chapter  describes  the  motivation,  methodology,  and  results  for 
the  first  definition  of  homogeneity  presented  in  this  thesis. 

Section  3.1-  The  Motivation 

Radar  ground  clutter  may  be  viewed  as  a  two-dimensional  signal  when 
the  magnitude  of  the  returns  are  represented  as  a  function  of  range  and 
azimuth.  While  the  radar  is  in  operation,  the  antenna  is  rotated  in 
azimuth,  and  magnitude  return  measurements  are  made  at  finite  range 
intervals.  This  scanning  process  may  be  regarded  as  a  two-dimensional 
convolution  of  the  antenna  beam  pattern  with  the  ground  reflectivity. 
The  two-dimensional  signal  generated  by  this  convolution  represents  the 
interaction  of  the  radar  beam  with  the  ground  surrounding  the  radar. 

Now,  if  the  ground  reflectivity  within  a  particular  region  varies 
erraticly  from  resolution  cell  to  resolution  cell,  one  would  expect  the 
two-dimensional  clutter  sequence  which  represents  the  area  to  appear  as 
a  rough  surface.  Furthermore,  this  type  of  region  should  possess  a 
two-dimensional  spectrum  that  is  irregularly  structured,  i.  e.,  the 
two-dimensional  spectrum  is  an  uneven  surface  which  flucuates  around  a 
constant  level.  If  the  spectra  of  an  infinite  number  of  such  sequences 
were  averaged,  then  this  irregular  structure  would  not  appear.  The 
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averaged  spectrum  would  be  perfectly  flat,  or  white.  However,  since 
this  is  not  the  case,  the  power  spectrum  of  a  single,  finite-length 
segment  of  a  rough  clutter  region  will  be  an  unevenly  flucuating 
surface. 

To  detect  this  irregular  structure,  a  test  for  "whiteness"  must  be 
applied  to  the  two-dimensional  spectral  estimates.  A  test  based  on 
runs-above-and-below  the  median  value  was  employed  for  this  purpose 
[15].  A  run  is  defined  as  a  sequence  of  elements  of  the  same  kind, 
i.  e.,  elements  that  are  above  or  below  the  median.  Any  region  whose 
spectrum  passed  this  test  was  said  to  contain  rough  clutter  in  that  the 
clutter  intensity  samples  from  that  region  could  be  represented  by  a 
rough  two-dimensional  surface.  The  statement  of  the  rough  clutter 
definition  is  the  following: 

Definition:  A  region  which  yields  an  irregularly  structured 
two-dimensional  spectrum,  as  determined  by  the  runs-above-and-below  the 
median  test,  is  said  to  be  homogeneous  in  that  the  region  contains  rough 
clutter . 

Section  3.2-  The  Method  of  Spectral  Estimation 

The  classical  method  of  spectral  estimation  [16]  is  the  method  used 
in  this  thesis.  This  method  involves  the  choice  of  a  data  window,  or 
taper,  which  reduces  the  bias  of  the  spectral  estimates  caused  by 
finite-length  data  sequences.  Next,  the  autocorrelation  estimates  of 
the  tapered  data  sequence  are  computed.  This  is  done  by  taking  the 
squared  magnitude  of  the  Fourier  transform  of  the  tapered  data  and 
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inverse  transforming  the  resulting  sequence.  The  periodogram  is  the 
inverse  transform  of  the  autocorrelation  sequence.  However,  to  reduce 
the  effects  of  the  unreliable  autocorrelation  estimates  at  large  values 
of  time  delay,  a  lag  window  is  applied  to  the  autocorrelation  sequence. 

The  two-dimensional  extension  of  the  classical  method  of  spectral 
estimation  is  based  on  the  two-dimensional  discrete  Fourier  transform 
(DFT),  whose  implementation  is  seen  to  comprise  the  iterated  application 
of  a  one-dimensional  fast  Fourier  transform  (FFT)  algorithm.  The 
following  text  presents  this  statement  more  formally. 

The  two-dimensional  DFT  of  a  sequence,  x(m,n),  is  given  by 

M-l  N-l 

X(k.Jl)  =  I  I  x  (m,n)W™  w“  (3.1) 

m=0  n=0 

where  w 
M 

WN 

Now,  equation  (3.1)  may  be  written  as 
N-i 

X(k,H)  =  I  G(k,n)W~n,  (3.2) 

n=0 


.  2-n 
=  e~J 

.  2t r 

=  e“J  ’ 


where 


G(k,n) 


M-l  , 

I  x(m,n)W 


The  function,  G(k,n),  is  seen  to  be  the  result  of  an  application  of  a 
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two-dimensional  transform  to  each  column  of  x(m,n).  Then,  by  applying 
the  one -dimensions  1  transform  to  each  row  of  G(k,n),  the  two-dimensional 
transform  of  x(m,n)  is  obtained. 

The  above  procedure  is  implemented  through  the  use  of  an  FFT 
subroutine.  The  data  taper  was  chosen  to  be  a  raised  cosine  of  width 
equal  to  18.75%  of  the  data  sequence,  or  2  points.  (Note:  The  DC  level 
is  removed  prior  to  tapering  by  subtracting  a  computed  average  value  for 
each  of  the  two-dimensional  data  segments  to  be  transformed.)  The  taper 
was  applied  to  each  of  the  four  edges  of  the  two-dimensional 
finite-length  sequence  which  was  sixteen  points  in  both  dimensions. 
This  size  data  sequence  was  chosen  after  some  experimentation  as  the 
minimum  length  that  would  give  reasonably  good  results.  The  lag  window 
was  chosen  to  be  a  Hanning  window  [16)  of  width  equal  to  the  length  of 
the  data  sequence.  The  two-dimensional  lag  window  was  generated  by 
rotating  the  appropriate  one-dimensional  window  about  the  origin  of  the 
two-dimensional  lag  plane  [17]. 

Section  3.3-  The  Test  for  Irregular  Structure 

The  final  set  of  power  spectrum  estimates  are  correlated  with  one 
another  because  of  the  convolution  of  the  actual  spectrum  and  the 
spectral  lag  window.  Since  the  runs-above-and-below  the  median  test  for 
randomness  assumes  independence  of  samples,  this  spectral  correlation 
must  be  dealt  with  in  an  appropriate  manner.  The  two-dimensional 
equivalent  noise  bandwidth  of  the  lag  window  was  selected  as  a  measure 
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of  the  spectral  correlation  [16].  The  two-dimensional  noise  bandwidth 
calculation  is  presented  in  Appendix  A.  The  bandwidth  is  3.13  spectral 
lines.  Therefore,  the  spectra  were  sampled  on  the  basis  of  every  fourth 
spectral  estimate  in  both  azimuthal  and  range  frequencies.  The 
resulting  set  of  spectral  estimates  was  tested  for  irregular  structure. 

The  test  for  irregular  structure,  or  "whiteness",  of  the 
two-dimensional  spectra  is  based  on  the  runs-above-and-below  the  median 
test  given  in  Hald  [14].  The  test  is  one-dimensional  by  nature  and  in 
order  to  apply  it  to  the  two-dimensional  spectral  data,  the 
two-dimensional  spectral  estimates  must  be  rearranged  into  a 
one-dimensional  sequence  (whereas  this  procedure  does  not  affect  the 
level  of  significance  of  the  test,  the  sensitivity  to  alteratives  may  be 
affected,  as  will  be  discussed  later).  This  was  done  by  appending 
columns  of  constant  azimuthal  frequency  to  the  next  column  of  higher 
azimuthal  frequency.  The  resulting  one-dimensional  sequence  was  tested 
at  the  5%  level  of  significance  using  the  number  of  runs  (>11.9)  as  the 
measure  of  significance.  The  number  of  runs  which  is  significant  may  be 
computed  as  in  [15].  Because  of  symmetry  in  the  two-dimensional 
spectrum,  only  half  the  spectral  estimates  are  needed  in  the  test. 

Section  3.4-  The  Calculation  of  Spectral  Spreads 

The  spectra  which  are  computed  in  order  to  locate  homogeneous 
regions  have  been  described  by  their  two-dimensional  spectral  spreads. 
The  two-dimensional  spectral  spreads  are  computed  by  projecting  the 
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spectral  lines  onto  each  frequency  axis  and  then  performing  a  variance 
calculation  on  the  resulting  function.  This  is  presented  more  formally 
in  the  following. 


The  equation  for  the  azimuthal  spread  is 
N  N 

5:  [  I'  X(i,j)j(i-ir 

2  i=l  _J  =  1 

°A  N  N 

Z  T.  X(i,J) 
i-lj-1 


(3.3) 


(3.4) 


where  N  is  the  number  of  spectral  lines  along  one  dimension.  The 
resulting  numerical  values  are  in  units  of  spectral  lines  squared. 


Section  3.5-  Description  of  Results 


The  format  of  the  two-dimensional  data  plots  and  spectral  plots, 
which  appear  at  the  end  of  Chapter  Five,  is  given  in  this  section. 

First,  the  three-dimensional  representation  of  the  backscatter 
intensity  data  sequence  (referenced  by  the  minimum  range  and  azimuth 
sample  appearing  in  the  16  by  16  point  region)  and  the  corresponding 
magnitude  spectrum  are  given.  The  orientation  of  the  range  and 
azimuthal  axes,  as  well  as  the  range  and  azimuthal  frequency  axes,  are 


shown  on  each  plot.  The  frequency  origin  is  located  in  the  center  of 
the  spectral  plane  in  the  spectral  magnitude  plots.  The  azimuthal 
frequency  increment  is  (0.5/16  cycles/degree)  and  the  range  frequency 
increment  is  (2.0/16  cycles/range  cell). 
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Chapter  Four-  The  Isodistributive  Definition 


This  chapter  describes  the  motivation,  methodology,  and  the  results 
for  the  second  definition  of  homogeneity  presented  in  this  thesis. 

Section  4.1-  The  Motivation 

The  West  Stockbridge  Hill  data  set  provides  ten  statistically 
independent  samples  of  backscatter  return  from  each  resolution  cell. 
This  type  of  data  allows  for  the  comparison  of  the  underlying 
probability  distributions  governing  the  backscatter  return  from  each 
cell.  Such  a  comparison  necessarily  involves  the  selection  and 
application  of  some  form  of  statistical  testing  procedure  to  a  group  of 
resolution  cells.  The  choice  of  which  type  of  test  to  use  must  be  based 
on  what  the  user  of  the  test  hopes  to  accomplish.  The  objectives  of  the 
user,  however,  need  not  be  overly  restrictive  as  to  the  capabilities  of 
the  test.  In  the  present  case,  it  is  desired  that  the  test  be  capable 
of  finding  contiguous  groups  of  resolution  cells  whose  sets  of  ten 
backscatter  samples  are  consistent  with  a  null  hypothesis  of  identical 
populations.  As  will  eventually  be  seen,  the  criterion  of  contiguity  is 
imposed  for  convenience  of  processing,  and  is  not  a  restriction  of  the 
selected  test. 

In  defining  regions  according  to  their  topography,  it  seems 
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reasonable  that  a  region  whose  resolution  cells  contain  comparatively 
similar  ground  types  should  be  deemed  homogeneous.  The  similarity  is  at 
best  subjective.  It  may  be  that  "resolution  cells  of  similar  ground 
types"  may  include  cells  which  subdivide  a  forested  lot,  cultivated 
land,  an  urban  area,  or  gently  rolling  hills.  In  addition  to  this,  it 
seems  fair  to  propose  that  areas  of  similar  ground  types  will  exhibit 
backscatter  return  distributions  of  a  kind.  However,  these  simple 
examples  of  ground  types  may  not  be  of  a  sufficiently  specific  nature. 
It  is  quite  probable  that  such  factors  as  slope,  soil  type,  and  other 
less  obvious  characteristics  of  the  ground  that  affect  backscatter 
properties  may  be  significant  in  determining  what  regions  may  be 
identified  as  "similar". 

Taking  the  above  into  account,  it  is  apparent  that  many  different 
probability  distributions  might  arise  in  the  measurement  of 
backscatter.  Therefore,  it  seems  both  appropriate  and  prudent  to  use  a 
statistical  test  which  does  not  require  prior  knowledge  of  the 
probability  distribution  from  which  the  samples  to  be  tested  are  drawn. 
It  is  for  this  reason  that  the  type  of  statistical  test  chosen  was  a 
non-parametric  test.  Furthermore,  the  specific  test  used  should  be  a 
multi-sample  test  since  the  number  of  resolution  cells  that  should  be 
considered  in  one  test  run  is  undetermined  a  priori.  Indeed,  it  may  be 
necessary  to  vary  the  number  of  included  cells  so  as  to  locate  the 
largest  regions  which  pass  the  test.  The  test  described  in  the  next 
section  meets  the  above  requirements  and  was  used  to  locate  regions 
which  correspond  to  the  following  definition  of  homogeneity: 
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Definit ion :  A  contiguous  set  of  resolution  cells  which  exhibit  no 
significant  differences  in  the  distribution  of  clutter  are  said  to  be 
homogeneous . 


Section  4 .2-  The  Kruskal-Wal 1  is  Test 


The  non-parametr ic  statistical  test  chosen  was  the  Kruskal-Wal 1 is 
test  for  identical  populations  which  is  sensitive  to  unequal  locations 
[13]. 

The  test  is  performed  by  forming  a  table  whose  entries  are  the 
observations  of  backscatter.  Each  column  of  the  table  contains  the  ten 
backscatter  observations  of  one  particular  resolution  cell.  The  order 
in  which  the  backscatter  values  of  a  cell  are  placed  in  the  column  is 
not  important,  but  the  backscatter  values  from  a  cell  must  all  be 
entered  in  one  column  of  the  table.  Now,  the  values  of  backscatter 
return  are  replaced  in  the  table  by  their  numerical  ranks  from  lowest  to 
highest  with  'l'  corresponding  to  the  lowest  value  of  backscatter.  The 
ranks  are  assigned  by  considering  every  observation  to  be  from  a  single 
pool  of  samples  so  that  ranks  ascend  in  order  from  1  to  N,  where  N  is 
the  total  number  of  observations  in  the  group  of  cells  tested.  The  rank 
sums  for  each  column  of  the  table  are  computed  and  their  squares  are 
used  in  calculating  a  statistic,  H,  according  to  the  following  equation: 


H 
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N (N+l )  Rl(Tl 


N+l  2 
2  ;  ’ 


(4.1) 


where  N»  total  number  of  observations 

O  number  of  columns  (or  cells) 

R.=  number  of  observations  in  column  i 
1 

T.=  rank  sum  for  column  i 
1 

T^=  T^/R^,  the  mean  value  of  ranks  in  column  i 


This  equation  is  reducible  to  a  form  more  conducive  to  calculation: 


H  =  -3 (N+l)  + 
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The  test  assumptions  that,  along  with  the  null  hypothesis,  are 
necessary  to  give  the  test  statistic,  H,  its  null  distribution  are  1) 
sampling  is  random,  2)  sampling  is  with  individual  replacement,  and  3) 
there  are  no  tied  observations. 

The  Stockbridge  Hill  data  set  meets  the  above  assumptions.  The 
data  per  resolution  cell  are  random  samples  insofar  as  all  the  possible 
combinations  of  ten  backscatter  values  have  equal  a  priori  probabilities 
of  occurrence  in  each  resolution  cell.  Hence,  the  requirement  of  the 
first  assumption  is  satisfied.  The  second  assumption  is  met  even  though 
the  data  are  quantized.  This  is  true  since  any  quantum  may  be 
repeated.  Now,  it  is  possible  that  quantization  could  cause  problems 
with  the  requirement  imposed  by  the  third  assumption.  However,  no  tied 
observations,  i.  e.,  no  observations  of  equal  value,  were  noted  during 
the  actual  tests.  If  such  tied  observations  had  occurred,  there  exist 
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methods  for  properly  handling  those  instances  [18]. 

An  advantage  in  using  H  as  the  test  statistic  is  that  its 
distribution  may  be  closely  approximated  by  the  chi-squared  distribution 
in  the  case  where  R^=R,  i.  e.  all  the  R^  are  equal.  The  approximation 
improves  as  R  increases.  In  our  case,  R=10  and  the  chi-squared 
distribution  is  a  very  good  approximation  to  the  null  distribution  of  H 
as  can  be  seen  in  Appendix  B  which  contains  an  explanation  of  why  the 
distribution  of  H  may  be  approximated  by  the  chi-squared  distribution. 

The  Kruskal-Wall is  test,  as  described  above,  was  applied  to  a 
version  of  the  West  Stockbridge  Hill  data  which  was  formed  by  spatially 
subsampling  the  original  data  set.  The  cells  were  selected  on  the  basis 
of  every  third  sample  in  azimuth  and  every  second  sample  in  range  so 
that  one  set  of  ten  samples  from  each  resolution  cell  of  the  radar 
appeared  in  the  data.  The  subsampling  was  done  to  ensure  spatial 
independence  on  the  bases  of  3  dB  beamwidth  and  range  resolution.  The 
resulting  data  set  comprised  eight  azimuths  with  128  range  samples  per 
azimuth. 

Section  4.3-  Description  of  Results 

The  regions  tested  were  of  varying  sizes.  First,  a  group  of  four 
resolution  cells,  two  in  azii  uth  and  two  in  range,  was  selected  as  a 
"window".  Next,  the  length  in  range  was  doubled  three  times,  so  that 
windows  of  2x4,  2x8,  and  2x16,  were  used.  The  windows  were  passed  over 
the  data  in  a  non-overlapping  fashion  and  the  resolution  cells  appearing 


in  each  window  were  tested  against  one  another  for  similarity  in  their 
backscatter  distributions.  The  results  for  the  windows  utilized  are 
shown  in  Figures  5. 2. 2-5. 2. 5.  Azimuth  is  the  horizontal  axis  and  range 
is  the  vertical  axis  in  these  plots.  The  crossed  areas  correspond  to 
windows  whose  enclosed  resolution  cells  passed  the  Kruskal-Wallis  test. 
These  crossed  regions  represent  homogeneous  regions  under  the  second 
definition.  Figure  5.2.6  presents  the  results  of  a  test  in  which  a 
2-cell  by  2-cell  window  was  slid  over  the  data  so  that  successive  tests 
overlapped  in  range  and  azimuth.  This  is  a  method  for  testing  all 
combinations  of  four  contiguous  cells  in  a  2-cell  square  windo\.  r\gure 
5.2.6  shows  large  regions  which  were  selected  for  testing  by  looking  for 
concentrations  of  small  homogeneous  regions  in  Figures  5. 2. 1-5. 2. 5. 
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Chapter  Five-  Presentation  and  Discussion  of  Results 


Section  5.1-  The  Rough  Clutter  Definition 

The  results  for  the  rough  clutter  definition  of  homogeneity  are 
presented  in  Figures  5.1.3-5.1.83,  which  are  located  at  the  end  of  this 
section.  In  each  figure  the  coordinates  (e.g.  "5X113")  of  the  area 
tested  are  shown.  Figure  5.1.1  is  a  three-dimensional  plot  of  the  Utica 
clutter  intensity  data.  Figure  5.1.2  is  a  three-dimensional  plot  of  the 
West  Stockbridge  Hill  mean  value  data  set. 

The  two-dimensional  power  spectral  estimation  routine  was  applied 
to  areas  in  the  Utica  clutter  map  which  were  chosen  as  being 
characteristic  of  the  overall  map.  The  representative  areas  selected 
fall  into  four  categories  which  are  based  on  their  backscatter  intensity 
plots.  First,  regions  of  high  intensity  backscatter  are  depicted  in 
Figures  5. 1.3-5. 1.8.  Figure  5.1.9-5.1.19  shows  eleven  areas  in  which 
the  greater  values  of  intensity  are  concentrated  in  a  localized 
"mound".  The  level  of  intensity  varies  over  the  eleven  mounded  areas 
presented.  Figures  5.1.20-5.1.26  show  areas  that  represent  the  low 
intensity  backscatter  regions  that  cover  much  of  the  Utica  clutter  map. 
Figures  5.1.27-5.1.28  graphically  depict  the  results  of  the  test  for 
irregularly  structured  spectra.  The  graphs  are  representations  of  the 
sampled  spectra  with  those  values  above  the  median  marked  by  an  "A"  and 
those  below  by  a  "B".  Figures  5.1.30-5.1.81  show  the  areas,  spectra, 
and  corresponding  tests  for  irregularly  structured  spectra  of  the  West 
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Stockbridge  Hill  clutter  map. 

In  Figures  5. 1.3-5. 1.8,  we  see  six  areas  of  the  Utica  clutter  map 
which  represent  the  high  intensity  areas  of  the  map.  The  intensity  for 
these  areas  flucuates  wildly  in  both  azimuth  and  range.  These  regions 
would  appear  to  be  prime  candidates  for  rough  clutter.  However,  as  can 
be  seen  from  the  spectral  plots,  these  areas  comprise  a  few  significant 
frequency  components.  These  frequency  components  suggest  a  marked 
periodicity  in  the  intensity  sequence.  Such  an  intensity  sequence  does 
not  fit  the  rough  clutter  definition,  the  test  for  irregularly 
structured  spectra  indicates  this  fact  clearly  in  that  all  the  high 
intensity  areas  fail  to  be  classified  as  homogeneous  under  the  rough 
clutter  definition. 

The  mounded  (medium  intensity)  areas  are  shown  in  Figures 
5.1.9-5.1.19.  These  areas  yield  spectra  which  are  generally 
well-behaved  surfaces  in  that  they  are  relatively  smooth.  However,  two 
of  the  spectra  for  this  type  of  area  pass  the  test  for  irregular 
structure.  These  observations  may  be  attributed  to  the  one-dimensional 
nature  of  the  runs-above-and-below  the  median  test  and  statistical 
chance.  The  fact  that  the  spectra  for  the  two  "homogeneous"  mounded 
areas  pass  the  test  for  irregular  structure  is  an  example  of  the  test's 
sensitivity  to  alternatives  to  randomness.  Since  these  spectra  are 
highly  concentrated  at  low  frequencies,  short  runs  above  the  median  are 
picked  up  by  the  test  and  an  erroneous  decision  is  made.  Also,  some 
irregular  structure  at  high  frequencies  increases  the  likelihood  that 


more  than  the  critical  number  of  runs  above-and-below  the  median  will  be 
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surpassed . 

The  flucuating  areas  in  Figures  5.1.9-5.1.12  were  thought  to  be 
candidates  for  rough  clutter.  However,  the  test  for  irregularly 

structured  spectra  proved  this  supposition  to  be  inaccurate.  None  of 
these  areas  passed  the  test  for  homogeneity  under  the  rough  clutter 
'  definition.  These  failures  are  due  to  the  fact  that  the  flucuations  are 

predominantly  low  frequency  oscillations. 

The  results  for  the  low  intensity  areas,  shown  in  Figures 
5.1.20-5.1.26  are  interesting  in  that  all  these  areas  are  homogeneous 
under  the  rough  clutter  definition.  All  the  low  intensity  spectra 
passed  the  test  for  irregular  structure  (see  Figure  5.1.28).  This  type 
of  intensity  plot  is  the  kind  which  the  rough  clutter  definition  was 
designed  to  classify  as  rough  clutter.  The  backscatter  intensity 
surfaces  can  be  seen  to  comprise  small,  uneven  disparities  from  sample 
to  sample.  This  uneveness  is  believed  to  arise  from  the  combination  of 
low  intensity  returns  and  the  thermal  noise  in  the  radar  receiver. 

The  two-dimensional  spectral  processing  has  located  no  rough 
clutter  regions  which  are  also  high  backscatter  intensity  areas. 
Although  this  fact  does  not  preclude  the  existence  of  high  intensity 
rough  clutter  regions  in  clutter  maps  other  than  those  analyzed  herein, 
it  has  indicated  two  possible  reasons  for  the  absence  of  such  regions  in 
the  Utica  and  West  Stockbridge  Hill  data  sets:  1)  the  resolution  cell 
size  attenuates  any  high  frequency  components  which  might  be  present  in 
the  clutter  signal,  and  2)  the  data  sequences  represent  areas  that  are 
so  large  that  changes  in  ground  elevation  dominate  the  clutter  signal 
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and  changes  in  ground  cover  do  not  appreciably  affect  the  clutter 
signal . 


To  illustrate  the  effects  of  the  resolution  cell  on  the 
two-dimensional  spectra  of  clutter  regions,  the  two-dimensional  spectrum 
of  an  approximation  to  the  beam  patterm  is  shown  in  Figure  5.1.29. 
Since  the  beam  pattern  is  convolved  with  the  ground,  the  clutter 
spectrum  is  multiplied  by  the  beam  pattern  spectrum.  This  convolution 
of  the  beam  with  the  ground  severely  attenuates  the  higher  clutter 
frequencies . 

The  intensity  and  spectral  plots  for  West  Stockbridge  Hill  are 
shown  in  Figures  5.1.30-5.1.81.  The  results  all  show  that  no  regions  in 
the  Stockbridge  clutter  map  are  homogeneous  according  to  the  rough 
clutter  definition.  This  is  because  the  two  hills,  Stockbridge  and 
Eaton,  dominate  the  map.  The  area  covered  by  the  spectral  estimation  is 
so  large  in  azimuth,  1566  m,  and  so  narrow  in  range,  480  m,  that  the 
hills  dominate  the  frequency  content  of  the  clutter  signal,  so  that 
little  variation  in  the  signal  occurs  in  the  range  dimension,  whereas 
one  full  period  occurs  in  the  azimuthal  direction.  This  essentially 
yields  very  narrow,  low  frequency  concentrated  spectra. 

The  above  discussion  indicates  that  the  area  which  the  spectral 
estimation  considers  as  a  two-dimensional  data  sequence  is  too  large  in 
space.  The  handicap  of  limited  radar  resolution,  coupled  with  the  fact 
that  the  two-dimensional  FFT  performs  poorly  on  short  data  sequences  and 
thus  requires  at  least  sixteen  clutter  samples  in  both  dimensions  for 
good  results,  suggests  that  the  area  required  in  the  spectral  estimation 
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BEAM  PATTERN  APPROXIMATION 


TWO-DIMENSIONAL  SPECTRUM  OF 
BEAM  APPROXIMATION 


TWO -DIMENSIONAL  SPECTRUM  OF 
BEAM  APPROXIMATION  IN  DO 


encompasses  a  ground  surface  which  is  simply  too  extensive  and 
topographically  complex  to  be  considered  homogeneous  by  this 
definition. 

In  Figures  5.1.82  and  5.1.83  ,  the  scatter  diagrams  of  the  Utica 
and  Stockbridge  spectral  spreads  are  shown,  respectively.  The  Utica 
spectral  spreads  show  that  the  low  intensity  areas  which  were  found  to 
be  homogeneous,  tend  to  cluster  at  large  values  of  spectral  spreads. 
Thus,  the  spectral  spreads,  or  bandwidths,  are  useful  in  separating 
these  areas  from  the  mounded,  or  medium  intensity,  areas  which  were 
erroneously  classified  as  homogeneous.  The  Stockbridge  bandwidths  do 
not  exhibit  any  particularly  significant  clusterings.  However,  the 
areas  closer  to  the  radar  (denoted  by  the  alphanumeric  characters 
' 1234567 89 ABCDEFG'  in  increasing  distance  from  the  radar)  tend  towards 
higher  azimuthal  bandwidths  whereas  the  areas  further  from  the  radar 
tend  towards  lower  azimuthal  bandwidths.  This  phenomenon  may  be 
attributed  in  part  to  the  increasingly  poor  azimuthal  resolution  as 
range  increases.  However,  one  can  see  that  the  areas  which  are  farthest 
from  the  radar  (denoted  by  'G')  tend  towards  higher  bandwidths.  These 
areas  are  partially  in  the  shadow  region  of  the  hills  as  were  the 
homogeneous  Utica  areas. 
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SAMPLED  SPECTRA  OF  UTICA  AREAS 
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Section  5.2-  The  Isodistributive  Definition 

In  attempting  to  locate  homogeneous  regions  under  the 
isodistributive  definition,  the  researcher  must  be  aware  that  the 
regions  in  question  may  vary  from  a  minimum  of  two  resolution  cells  to 
as  many  as  are  contained  within  the  limits  of  the  data.  For  the 
independent  cells  of  West  Stockbridge  Hill,  the  total  number  of 
available  window  sizes  is  1023.  Clearly,  a  choice  of  window  size,  or 
sizes,  must  be  made  since  running  the  test  on  the  entire  data  set  for 
that  many  windows  would  be  prohibitive  in  terms  of  time  and  funding. 
Therefore,  a  few  window  sizes  were  used  to  find  whether  there  are  any 
homogeneous  regions  under  the  second  definition  of  homogeneity.  The 
results  presented  in  Figure  5. 2. 1-5. 2. 6,  located  at  the  end  of  this 
section,  show  that  homogeneous  regions  (crossed  areas)  of  varying  sizes 
can  be  found  in  the  clutter  data. 

Figure  5.2.1  shows  homogeneous  regions  that  are  two  resolution 
cells  in  azimuth  by  two  resolution  cells  in  range.  There  were  256  such 
regions  tested.  Figure  5.2.1  illustrates  that  135  of  these  proved  to  be 
homogeneous  under  the  isodistributive  definition.  This  confirms  that 
homogeneous  regions  which  are  6  degrees  wide  in  azimuth  and  120  m  in 
range  exist  in  great  numbers  in  the  West  Stockbridge  Hill  data.  The 
area  covered  by  these  regions  is  over  50%  of  the  total  area. 

In  Figure  5. 2. 1-5. 2. 4,  each  figure  represents  a  doubling  of  the 
window  length  in  range.  It  is  apparent  that  a  smaller  ratio  of 
homogeneous  area  to  total  area  is  obtained  as  the  size  of  the  window 
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increases.  This  is  not  surprising  if  one  considers  that  a  greater 
number  of  cells  per  test  increases  the  probability  that  at  least  one  of 
the  included  cells'  backscatter  will  be  governed  by  a  dissimilar 
probability  distribution. 

One  feature  of  the  results  for  the  second  definition  is 
particularly  interesting.  This  is  that  the  homogeneous  regions  tend  to 
be  located  in  areas  of  mid-range  mean  value  of  backscatter.  Regarding 
Figure  5.2.1  and  Figure  5.1.2,  and  comparing  (allowing  for  distortion 
between  the  plots),  one  can  see  that  the  homogeneous  regions  do  not 
occur  for  the  larger  window  sizes  in  areas  of  high  backscatter 
intensity,  nor  do  they  occur  in  areas  of  low  backscatter  intensity. 
These  regions  of  mid-range  intensity  correspond  to  the  radar-facing 
slopes  of  hills  as  well  as  the  backsides.  No  other  outstanding  feature 
of  the  topography  of  these  regions  is  apparent. 

Figure  5.2.5  shows  regions  which  passed  a  two  cell  by  two  cell 
sliding  window.  The  results  are  plotted  on  a  radial  graph  like  that  of 
Figure  2.2  for  easy  comparison.  The  eight  azimuths  chosen  for 
independence  are  plotted.  The  longest  arcs  are  cells  which  passed  the 
Kruska 1-Wall is  test  in  each  of  the  four  windows  in  which  it  appeared. 
Shorter  arcs  denote  cells  that  passed  three  tests,  two  tests,  and  one 
test,  with  no  arc  at  all  signifying  cells  that  failed  all  possible 
tests.  Cells  that  are  along  the  edges  of  the  data  sequence  can  appear 
in  only  two  tests,  except  for  those  points  located  at  the  very  corners 
which  appear  in  only  one  test. 

The  largest  homogeneous  regions  found  appear  in  Figure  5.2.6. 


These  were  found  by  testing  areas  selected  by  looking  for  areas  in 
Figure  5.2.1,  which  are  dense  in  terms  of  the  cells  that  passed  the  two 
by  two,  sliding  window.  The  areas  which  s! ow  the  greatest  concentration 
of  small  homogeneous  regions  were  suspected  of  larger  scale 
homogeneity.  Again,  these  regions  correspond  to  leading  and  trailing 
edges  of  hills  that  appear  on  the  topographic  maps  of  the  West 
Stockbridge  Hill  area.  As  can  be  seen  in  Figure  5.2.6,  one  area 
stretches  across  four  cells  in  azimuth.  This  region,  in  particular, 
corresponds  to  the  radar-facing  side  of  West  Stockbridge  Hill,  which 
exhibits  a  very  large  gradient,  predominantly  in  the  radial  direction  as 
measured  from  the  radar.  The  same  is  true  of  the  area  that  overlaps  the 
four  azimuth  wide  region,  only  it  is  approximately  twice  the  length  in 
range  as  the  previous  region.  The  remaining  two  regions  correspond  to 
the  backsides  of  two  hills.  The  area  which  spans  azimuths  5  and  6  is  on 
the  backside  of  Stockbridge  Hill.  The  remaining  area  is  on  the  backside 
of  Eaton  Hill,  whose  crest  is  just  outside  the  area  covered  by  the 
data.  Eaton  Hill  is  within  the  minimum  range  of  the  data  and  just 
outside  the  209  degree  azimuth. 

The  results  of  the  second  definition  of  homogeneity  represent 
several  runs  of  the  Kruskal-Wall is  test.  The  results  shown  in  Figure 
5.2.6  were  tested  on  the  basis  of  the  previous  results.  This 
"result-guided"  testing  procedure  was  devised  to  find  the  largest  areas 
which  might  pass  the  test.  It  is  desirable  to  locate  the  largest 
regions  because  these  areas  may  prove  to  be  the  most  useful,  since  areas 
fitting  this  definition  are  homogeneous  regions  within  which  the 
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statistics  of  the  backscatter  in  the  individual  resolution  cells  are 
similar  to  one  another.  This  type  of  information  may  prove  to  be  very 
useful  to  radar  system  designers.  In  particular,  the  knowledge  that 
homogeneous  regions  exist  which  are  a  pulse  length  or  more  in  range 
duration  will  possibly  aid  in  the  design  of  radars  employing  such  signal 
processing  techniques  as  pulse  compression  and  CFAR. 
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Chapter  Six-  Conclusions  and  Suggestions  for  Future  Work 

Section  6.1-  Conclusions 

The  overall  goal  of  this  project  is  to  determine  a  relationship 
between  the  topography  of  the  radar  environment  and  the  clutter  which  is 
perceived  by  either  a  human  observer  or  an  automatic  radar.  In  moving 
towards  this  objective,  the  present  study  has  offered  an  approach  that 
begins  with  the  separation  of  radar  clutter  into  its  components.  These 
components  have  been  designated  to  be  homogeneous  regions. 

Two  definitions  of  radar  clutter  homogeneity  have  been  presented 
and  tested  for  their  applicability  to  two  sets  of  backscatter  data,  the 
Utica  and  West  Stockbridge  Hill  data  sets.  In  keeping  with  Webster's 
definition  of  the  word  "homogeneous"  [19],  the  two  definitions  of  radar 
clutter  homogeneity  indicate  areas  in  which  the  clutter  signal  possesses 
a  uniform  structure  throughout  the  enclosed  area.  The  rough  clutter 
definition  encompasses  clutter  regions  which  are  homogeneous  in  the 
sense  that  the  finite,  two-dimensional  clutter  signal  within  the 
homogeneous  region  represents  a  broken,  uneven  surface.  The 
isodistributive  definition  of  radar  clutter  homogeneity  includes  regions 
which  are  homogeneous  in  the  statistical  sense,  i.e.,  the  backscatter 
measurements  in  every  cell  of  the  region  can  be  modeled  by  a  common 
probability  distribution.  Both  definitions  were  successful  in  locating 
homogeneous  regions.  The  rough  clutter  regions  seem  to  correspond  to 
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ground  areas  which  exhibit  low  intensity  radar  return  which  is  just 
above  the  noise  level  in  the  radar  receiver.  The  isodistributive 
definition  has  located  areas  that  lie  in  the  valleys  between 
high-intensity  regions. 

The  viability  of  the  rough  clutter  definition  as  presented  in  this 
thesis  is  heavily  dependent  upon  the  radar  resolution  as  demonstrated  in 
Section  5.1.  Three  possible  solutions  to  this  problem  remain  to  be 
investigated. 


First,  a 

determination 

of  the  actual 

beam  pattern 

which 

is 

convolved  with 

the  ground 

should  be  made 

so  that  some 

form 

of 

equalization  for  beam  effects  may  be  attempted.  However,  any  promising 
results  which  lie  in  this  direction  rely  on  both  a  beam  pattern  spectrum 
which  is,  at  least,  approximately  invertible,  and  an  insignificant 
degradation  of  the  signal-to-noise  ratio  at  the  higher  two-dimensional 
frequencies  where  beam  pattern  equalization  would  have  a  great  effect  on 
boosting  the  effect  of  receiver  noise. 

Second,  an  increase  of  the  radar  resolution,  so  that  each  recorded 
backscatter  sample  represents  an  area  of  ground  on  the  order  of  a  few 
square  meters,  would  also  increase  the  practicality  of  the  rough  clutter 
definition.  This  approach  requires  an  increase  in  both  range  resolution 
and  azimuthal  resolution.  The  increased  range  resolution  is  generally 
attainable  through  pulse  compression  techniques,  which  are  in  general 
use.  On  the  other  hand,  increasing  the  azimuthal  resolution  of  many 
presently  active  search  radars  would  require  costly  changes  in  their 


antenna  structures. 


The  third  solution  is  to  widen  the  definition  of  homogeneity  in 
regard  to  the  two-dimensional  frequency  content  of  the  clutter  signal. 
It  is  apparent  that  the  rough  clutter  definition  is  very  stringent  when 
applied  to  low  resolution  clutter  returns.  It  may  be  sufficient  to 
define  clutter  regions  in  which  the  spectral  content  includes  a  dominant 
frequency  component  which  is  removed  from  either  or  both  frequency 
axes.  This  type  of  categorical  definition  would  indicate  regions  whose 
clutter  returns  exhibit  significant  flucuations  in  either,  or  both, 
range  and  azimuth.  Such  homogeneous  regions  may  be  useful  to  radar 
designers  who  employ  CFAR  techniques  which  require  a  constant  updating 
of  clutter  rejection  thresholds.  This  type  of  undulating  clutter  region 
would  disclose  the  rate  at  which  the  rejection  threshold  must  be 
updated.  Along  the  same  lines,  some  promise  in  the  application  of 
clustering  algorithms  to  parametric  descriptions  of  the  two-dimensional 
spectra  of  clutter  regions  is  indicated  in  the  results  of  the 
two-dimensional  spectral  spreads  presented  in  Figures  5.1.82  and 
5.1.83.  With  other  spectral  parameters  such  as  energy  content  and  peak 
spectral  component  location  included,  the  parametric  descriptions  of  the 
spectra  might  yield  interesting  results  when  processed  by  clustering 
algorithms. 

The  isodistributive  definition  appears  to  be  useful  in  predicting 
that  areas  in  the  valleys  adjacent  to  high-intensity  regions  are 
homogeneous  in  that  the  resolution  cells  in  those  areas  exhibit  locally 
equivalent  probability  distributions.  Such  information  suggests  that 
some  common  mechanism  or  effect  is  responsible  for  the  equivalency  of 
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the  cellular  distributions.  One  possibility  that  may  warrant  further 
investigation  is  that  the  convolution  of  the  compressed  pulse  shape  with 
the  hilltops  produces  notable  sidelobes  which  may  cause  homogeneous 
clutter  distributions  within  the  resolution  cells  wherein  the  sidelobes 
reside. 

Further  testing  of  the  isodistributive  definition  is  necessary . 
Additional  backscatter  data  sets  of  West  Stockbridge  Hill,  as  well  as 
other  similarly  hilly  areas,  are  required  to  determine  whether  this 
definition  consistently  corresponds  to  slopes  of  hills  over  temporal  and 
spatial  differences.  If  it  proves  to  be  consistent,  then  then  the 
definition  will  be  particularly  applicable  to  radar  signal  processing 
techniques,  such  as  CFAR,  which  utilize  statistical  clutter  information 
to  reject  clutter. 


Section  6.2-  Suggestions  for  Future  Research 

Several  possible  new  directions  may  be  taken  in  solving  the  problem 
of  suitably  defining  homogeneous  regions.  In  Figure  6.1,  the  mean  value 
of  the  West  Stockbridge  Hill  data  is  displayed  in  a  three-dimensional 
plot.  Figure  6.2  shows  the  variance  plot  of  the  same  data  set.  It 
appears  that  some  positive  spatial  correlation  exists  between  the  mean 
and  variance  of  the  West  Stockbridge  Hill  data  set.  In  Figure  6.3,  the 
normalized  standard  deviation  is  displayed,  where  the  normalization  is 
to  the  mean  value  of  the  backscatter  in  the  resolution  cell.  These 


observations  are  consistent  with  the  non-homogeneous ,  Rayleigh  clutter 
model  discussed  by  Szajnowski  [12].  Further  work  along  this  line  may  be 
useful  in  relation  to  the  isodistributive  definition  of  homogeneity 
which  requires  a  more  formal  method  of  preprocessing.  In  addition  to 
the  aforementioned  processing,  the  third  data  set  of  coherently  detected 
data,  taken  in  the  West  Stockbridge  Hill  area,  has  been  doppler 
processed  and  the  corresponding  doppler  spreads  have  been  computed  and 
are  shown  in  Figure  6.4,  where  long  arcs  on  the  radial  graph  denote 
large  doppler  spreads  and  short  arcs  indicate  smaller  spreads.  Some 
correlation  of  the  large  doppler  spreads  with  large  mean  backscatter 
return  can  be  seen.  Further  work  in  discovering  the  cause  of  this 
correlation  would  require  study  of  the  ground  cover  in  these  areas  of 
large  doppler  spread  and  a  corresponding  theoretical  analysis  of  the 
doppler  spreads  which  could  be  expected. 

In  continuing  along  the  lines  of  research  in  the  present  study,  the 
next  iteration  of  work  should  include  the  investigation  of  methods  for 
collating  the  topographic  maps  with  the  clutter  maps  so  that  spatially 
accurate  comparisons  of  topographic  features  and  the  homgeneous  regions 
located  by  both  the  rough  clutter  definition  and  the  isodistributive 
definition  may  be  made.  This  is  by  no  means  a  simple  task.  The 
automatic  collation  of  maps  remains  as  one  of  the  unsolved  problems  in 
pattern  recognition.  As  a  practical  example  of  this  problem,  attempts 
to  find  a  point  of  reference  in  the  West  Stockbridge  Hill  clutter  data 
failed.  There  are  several  large  radio  towers  on  the  crest  of  the  hill 
which  should  be  easily  detectable  through  the  fact  that  their 
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backscatter  returns  should  be  of  high  mean  value  and  low  variance. 
However,  no  such  points  could  be  successfully  located.  The  solution  to 
this  map  collation  problem  may  lie  in  the  utilization  of  digitized 
maps.  Such  maps  include  specific  location  information  and,  in  some 
cases,  also  contain  a  great  deal  of  topographic  information.  These  maps 
would  prove  useful  in  solving  both  the  collation  problem  and  the 
topography-clutter  relationship  problem.  However,  difficulties  in 
aligning  the  topographic  maps  and  the  clutter  maps  may  still  occur  and 
by  no  means  are  digitized  maps  the  complete  solution  to  the 
topography-clutter  relationship  problem. 

The  solution  to  the  topography-clutter  problem  lies  in  the 
successful  collaboration  of  researchers  who  approach  the  problem  from 
the  direction  taken  in  this  work  and  those  who  attempt  to  statistically 
model  the  radar  environment  and  formulate  backscatter  models. 
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Appendix  A 


The  derivation  of  the  equivalent  noise  bandwidth  for  the 
two-dimensional  lag  window  is  given  in  the  following. 

It  is  desired  that  the  expected  value  of  the  spectral  estimate  be 
equal  to  the  actual  spectral  value: 

E[C(k,l>]  =  r  (k, .')  • 


The  windowed  spectral  estimate  is 


E[ C (k, £)  ]  =  E[C (k , £) ] *W(k, A) . 


(A. 2) 


Substituting  (A.l)  in  (A. 2),  we  have  the  desired  result 

E(C(k,  t)l  =  r(k,it)*W(k,SO. 


(A. 3) 


Expanding  the  discrete  two-dimensional  convolution,  we  obtain 

00  CO 

r(k,f,)*w(k,t)  =  I  £  r(v,n)W(k-v,t-n).  (a. 4) 

^=—oo  \)  =  —00 
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Now,  we  desire  that 


r  (k,o*w(k,o  =  r(k,A). 


(A. 5) 


Assuming  that  the  bandwidth  of  the  window  is  narrow  compared  to  the 
actual  spectrum,  we  may  treat  r  as  a  constant,  so  that 


r  I  2  W(k-v ,  n-n)  =  E[C  (k,  S.)  ] .  (A. 6) 

f]  =  _0O  \)  —  —CC 


This  implies  that 


00  OO 

E  E  W(k-v ,  H-n)  =  1  (A. 7) 

)^  =  -oo  y =— CD 


which  is  to  say  that  the  volume  under  our  two-dimensional 
two-dimensional  window  should  be  equal  to  unity.  The  equivalent  window 
should  be  a  right  circular  cylinder  of  radius  B  and  height  given  by 

W2(0,0)  *  ~  .  (A. 8) 

ttB 


To  obtain  the  equivalent  noise  bandwidth,  B,  of  the  window,  we  use 
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the  expression 


iB 


N  N 

i  z  wz(k,s>) 

2  =  N  k=l  1=1 _ 


(A. 9) 


N 


I  w2(0,0) 


Using  Parseval's  theorem  and  substituting  for  W  (0,0),  (A. 9)  becomes 


nB2  = 


N2  Z  Z  w2(m,n) 
m  n 


(A. 10) 


But  we  note  from  (A. 7)  that 


w(o ,0)  =  -4 1 1  w<k.*>  -  -4 


(A. 11) 


N  k  S’ 


Since  in  our  case  W(0,0)=1.  We  must  multiply  the  entire  sequence  in 
(A. 10)  by  1/N2,  so  that  finally 


B 


_ N2/tt 

N  N  £ 

Z  £  w  (m,n) 
m=l  n=l 


1/2 


(A. 12) 


In  our  case  2B=3.13. 

A  three-dimensional  plot  of  the  two-dimensional  lag  window  for  a 
length  of  2  points,  which  is  that  utilized  in  this  thesis,  is  displayed 
in  Figure  A.l . 
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Appendix  B 


The  following  is  a  brief  explanation  of  the  chi-squared 
approximation  to  the  null  distribution  of  the  test  statistic  H  used  in 
the  Kruskal-Wallis  test. 

Non-parametric  tests  tend  to  use  information  that  is  strictly 
sample-related.  Therefore,  the  null  hypothesis,  and  the  assumptions 
which  are  tested,  are  indirectly  related  to  a  population-related 
hypothesis.  This  hypothesis  may  be  logically  connected  to  the  null,  or 
sample-related,  hypothesis.  In  the  present  situation,  the 
population-related  hypothesis  is  that  the  C  populations,  from  which  the 
C  sets  of  samples  are  drawn,  are  identically  distributed.  The 
sample-related  hypothesis  is  that  there  are  equal  a  priori  probabilities 
for  each  of  the  distinguishable  divisions  of  the  N  ranks  into  C  columns 
of  R  ranks  per  column.  The  total  number  of  such  divisions  is  given  by: 

D=  N!/(R!)C  (B.l ) 


from  the  rules  for  permutations  and  combinations  [13]. 


It  is  desirable  for  the 
supplied  by  the  column  rank  sums. 


test  statistic  to  include  information 
Let  us  choose 
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mm am, 


(B.2) 


C 

H'  =  Z 
i=l 


Ri  Ti 


as  the  variable  part  of  the  test  statistic.  Furthermore,  let  us  assume 
that  the  R^  are  all  equal.  Equation  (B.2)  becomes 


H' 


—  2 

Z  R  T  . 
i=l 


(B.3) 


Now,  is  the  sum  of  R  observations  made  on  a  finite  population,  the 

ranks  from  1  to  N.  The  mean  of  this  finite  population  is 


n 


(B.4) 


and  the  population  variance  is 


2 

o 


N 


(B.5) 


Under  the  null  hypothesis,  the  R  observations,  which  sum  to  T^,  will  be 
randomly  drawn  without  replacement  from  the  finite  population  of  ranks. 
In  this  situation,  as  a  variate  will  have  a  mean  given  by 

u  ,  RXN+D  ( B.6 ) 

s  2 
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and  a  variance 


2  =  R(N2-1)  (N-R) 
s  12  (N-l) 


(B.7) 


where  the  final  fraction  in  equation  (B.7)  is  the  correction  factor  for 
sampling  without  replacement.  By  virtue  of  the  Central  Limit  Theorem, 
the  distribution  of  the  variate,  T^,  will  approach  a  normal  distribution 
as  R  increases.  The  quantity 

T  -  [R(N+l)/2] 

[R(N2-1)/12] [N-R)/ (N— 1 ) ] 

may  be  treated  as  a  variate  with  normal  distribution,  N(0,1).  The  sum 
of  the  squares  of  C  of  the  above  unit  normal  variates  would  be 
distributed  as  chi-squared  with  C  degrees  of  freedom.  However,  only  C-l 
of  the  rank  sums  are  independent,  since  the  last  T^  is  determined  by 
elimination  of  the  ranks  summing  to  the  previous  T^'s.  Therefore,  only 
C-l  of  the  T^  considered  may  be  regarded  as  independent.  Hence,  C-l/C 
of  the  sum  of  the  squares  of  T^  is  distributed  as  chi-squared  with  C-l 
degrees  of  freedom  as  in 

2  C-l  C  fTi"lR(N+l)/2]  i2 

X  "  ~  R(N+1)(N-R)/12  (B'8) 


Rearranging  equation  (B.8),  we  have 
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aa 


2 

x 


12(C-1) 
C(N-R) (N+l) 


C 

r 

1=1 


{T.  - 
1 


[R(N+L)/12]  }' 


(B.9) 


Noting  that 


T. 

l 


=  R  T. 


l 


(B .10) 


and 


N=RC 


(B.ll) 


and  substituting  equations  (B.10)  and  (B.ll)  in  (B.9)  we  have, 


2 


x 


~  12 

=  N (N+l) 


C 

I 

i=l 


N+l.  2 
2  ; 


(B.12) 


which  is  equal  to  H  as  in  equation  (B.l). 
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